Systems and methods for distributed low density parity check decoding

ABSTRACT

Systems and method relating generally to data processing, and more particularly to systems and methods for utilizing multiple data streams for data recovery from a storage device. In some cases the systems include a low density parity check data decoder circuit including at least a first data decoder engine and a second data decoder engine each electrically coupled to a common circuit. The common circuit is operable to: shift a combination of both a first sub-message from the first data decoder engine and the second sub-message from the second data decoder engine to yield an shifted output, and disaggregate the shifted output to yield a third sub-message to the first data decoder engine and a fourth sub-message to the second decoder engine.

FIELD OF THE INVENTION

The present inventions are related to systems and methods for data processing, and more particularly to systems and methods for performing data decoding.

BACKGROUND

Various data processing systems have been developed including storage systems, cellular telephone systems, and radio transmission systems. In such systems data is transferred from a sender to a receiver via some medium. For example, in a storage system, data is sent from a sender (i.e., a write function) to a receiver (i.e., a read function) via a storage medium. As information is stored and transmitted in the form of digital data, errors are introduced that, if not corrected, can corrupt the data and render the information unusable. The data decoding can become relatively complex making implementation in a high speed system costly and in some cases not possible.

Hence, for at least the aforementioned reasons, there exists a need in the art for advanced systems and methods for data decoding.

BRIEF SUMMARY

The present inventions are related to systems and methods for data processing, and more particularly to systems and methods for performing data decoding.

Various embodiments of the present invention provide data processing systems that include a low density parity check data decoder circuit. The low density parity check decoder circuit includes at least a first data decoder engine and a second data decoder engine each electrically coupled to a common circuit. The first data decoder engine is operable to perform variable node updating for a first portion of a codeword and the second data decoder engine is operable to perform variable node updating for a second portion of the codeword. The common circuit is operable to perform a syndrome calculation on a combination of at least the first portion and the second portion of the codeword.

This summary provides only a general outline of some embodiments of the invention. The phrases “in one embodiment,” “according to one embodiment,” “in various embodiments”, “in one or more embodiments”, “in particular embodiments” and the like generally mean the particular feature, structure, or characteristic following the phrase is included in at least one embodiment of the present invention, and may be included in more than one embodiment of the present invention. Importantly, such phases do not necessarily refer to the same embodiment. Many other embodiments of the invention will become more fully apparent from the following detailed description, the appended claims and the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

A further understanding of the various embodiments of the present invention may be realized by reference to the figures which are described in remaining portions of the specification. In the figures, like reference numerals are used throughout several figures to refer to similar components. In some instances, a sub-label consisting of a lower case letter is associated with a reference numeral to denote one of multiple similar components. When reference is made to a reference numeral without specification to an existing sub-label, it is intended to refer to all such multiple similar components.

FIG. 1 shows a storage system including a distributed architecture low density parity check decoder circuitry in accordance with various embodiments of the present invention;

FIG. 2 depicts a data transmission system including a distributed architecture low density parity check decoder circuitry in accordance with one or more embodiments of the present invention;

FIG. 3 shows a data processing circuit including a distributed architecture low density parity check decoder circuit in accordance with some embodiments of the present invention; and

FIGS. 4 a-4 b depicts one implementation of a distributed architecture low density parity check decoder circuit in accordance with one or more embodiments of the present invention; and

FIG. 5 shows an example physical layout facilitated by a distributed architecture low density parity check decoder circuit in accordance with some embodiments of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

The present inventions are related to systems and methods for data processing, and more particularly to systems and methods for performing data decoding.

Various embodiments of the present invention provide data processing systems that include a low density parity check data decoder circuit. The low density parity check decoder circuit includes at least a first data decoder engine and a second data decoder engine each electrically coupled to a common circuit. The first data decoder engine is operable to perform variable node updating for a first portion of a codeword and the second data decoder engine is operable to perform variable node updating for a second portion of the codeword. The common circuit is operable to perform a syndrome calculation on a combination of at least the first portion and the second portion of the codeword. In some instances of the aforementioned embodiments, the first data decoder engine is not electrically connected to the second data decoder engine. In various instances of the aforementioned embodiments, the number of signal routes directly between the first data decoder engine and the second data decoder engine is less than twenty. In more particular instances, the number of signal routes directly between the first data decoder engine and the second data decoder engine is less than ten.

In various instances of the aforementioned embodiments, the first data decoder engine is operable to: generate a first new check node to variable node message; add a first portion of a codeword to the first new check node to variable node message to yield a first interim output; rearrange the first interim output to yield a first sub-message; and provide the first sub-message to the common circuit. The second data decoder engine is operable to: generate a second new check node to variable node message; add a second portion of the codeword to the second new check node to variable node message to yield a second interim output; rearrange the second interim output to yield a second sub-message; and provide the second sub-message to the common circuit. In some cases, the first data decoder engine includes: a first minimum path calculation circuit operable to calculate a first minimum path between a present node and a next node; and a first new message update circuit operable to generate the first new check node to variable node message based at least in part on the first minimum path. The second data decoder engine includes: a second minimum path calculation circuit operable to calculate a second minimum path between a present node and a next node; and a second new message update circuit operable to generate the second new check node to variable node message based at least in part on the second minimum path.

In one or more cases, the common circuit is operable to perform the syndrome calculation based at least in part on both the first sub-message and the second sub-message to yield a syndrome value. In some such cases, the common circuit is further operable to determine whether processing of a data set represented at least in part by the first sub-message and the second sub-message converged. In particular cases, processing of the data set is determined to converge when the syndrome value equals a defined value. The defined value may be zero. In various cases, the common circuit is further operable to: shift a combination of both the first sub-message and the second sub-message to yield an shifted output; and disaggregate the shifted output to yield a third sub-message to the first data decoder engine and a fourth sub-message to the second decoder engine. In some such cases, the first data decoder engine is further operable to: generate a first old check node to variable node message; subtract the third sub-message from the first old check node to variable node message to yield a third interim output; and normalize the third interim output to an expected range. The second data decoder engine is further operable to: generate a second old check node to variable node message; subtract the fourth sub-message from the second old check node to variable node message to yield a fourth interim output; and normalize the fourth interim output to the expected range.

In some instances of the aforementioned embodiments, the first data decoder circuit and the second data decoder circuit are physically layed out in relation to the common circuit such that at least one side of the common circuit is physically accessible by routes from another circuit. In various instances of the aforementioned embodiments, the data processing system is implemented as an integrated circuit. In some instances of the aforementioned embodiments, the data processing system is incorporated in, for example, a storage device or a data communication device.

Other embodiments of the present invention provide methods for data processing that include providing a distributed architecture data decoder circuit. The low density parity check data decoder circuit including at least a first data decoder engine and a second data decoder engine each electrically coupled to a common circuit. The first data decoder engine is operable to perform variable node updating for a first portion of a codeword and the second data decoder engine is operable to perform variable node updating for a second portion of the codeword, and the common circuit is operable to perform a syndrome calculation on a combination of at least the first portion and the second portion of the codeword. The methods further include: generating a first new check node to variable node message in the first data decoder engine, adding a first portion of a codeword to the first new check node to variable node message to yield a first interim output in the first data decoder engine, rearranging the first interim output to yield a first sub-message in the first data decoder engine, generating a second new check node to variable node message in the second data decoder engine, adding a second portion of the codeword to the second new check node to variable node message to yield a second interim output in the second data decoder engine, rearranging the second interim output to yield a second sub-message in the second data decoder engine, and providing the second sub-message to the common circuit.

In some instances of the aforementioned embodiments, the number of signal routes wires directly between the first data decoder engine and the second data decoder engine is less than twenty. In various instances of the aforementioned embodiments, the common circuit is operable to perform the syndrome calculation based at least in part on both the first sub-message and the second sub-message to yield a syndrome value. In one or more instances of the aforementioned embodiments, the methods further include: shifting a combination of both the first sub-message and the second sub-message to yield an shifted output in the common circuit; disaggregating the shifted output to yield a third sub-message to the first data decoder engine and a fourth sub-message to the second decoder engine in the common circuit; generating a first old check node to variable node message in the first data decoder engine; subtracting the third sub-message from the first old check node to variable node message to yield a third interim output in the first data decoder engine; normalizing the third interim output to an expected range in the first data decoder engine; generating a second old check node to variable node message in the second data decoder engine; subtracting the fourth sub-message from the second old check node to variable node message to yield a fourth interim output in the second data decoder engine; and normalizing the fourth interim output to the expected range in the second data decoder engine.

Turning to FIG. 1, a storage system 100 including a read channel circuit 110 having a distributed architecture low density parity check decoder circuitry in accordance with various embodiments of the present invention. Storage system 100 may be, for example, a hard disk drive. Storage system 100 also includes a preamplifier 170, an interface controller 120, a hard disk controller 166, a motor controller 168, a spindle motor 172, a disk platter 178, and a read/write head 176. Interface controller 120 controls addressing and timing of data to/from disk platter 178. The data on disk platter 178 consists of groups of magnetic signals that may be detected by read/write head assembly 176 when the assembly is properly positioned over disk platter 178. In one embodiment, disk platter 178 includes magnetic signals recorded in accordance with either a longitudinal or a perpendicular recording scheme.

In a typical read operation, read/write head assembly 176 is accurately positioned by motor controller 168 over a desired data track on disk platter 178. Motor controller 168 both positions read/write head assembly 176 in relation to disk platter 178 and drives spindle motor 172 by moving read/write head assembly to the proper data track on disk platter 178 under the direction of hard disk controller 166. Spindle motor 172 spins disk platter 178 at a determined spin rate (RPMs). Once read/write head assembly 176 is positioned adjacent the proper data track, magnetic signals representing data on disk platter 178 are sensed by read/write head assembly 176 as disk platter 178 is rotated by spindle motor 172. The sensed magnetic signals are provided as a continuous, minute analog signal representative of the magnetic data on disk platter 178. This minute analog signal is transferred from read/write head assembly 176 to read channel circuit 110 via preamplifier 170. Preamplifier 170 is operable to amplify the minute analog signals accessed from disk platter 178. In turn, read channel circuit 110 decodes and digitizes the received analog signal to recreate the information originally written to disk platter 178. This data is provided as read data 103 to a receiving circuit. A write operation is substantially the opposite of the preceding read operation with write data 101 being provided to read channel circuit 110. This data is then encoded and written to disk platter 178.

As part of processing the received information, read channel circuit 110 utilizes a distributed architecture low density parity check decoder circuit. In operation, a codeword is divided into multiple parallel portions that are fed to respective data decoder engines. The data decoder engines each include variable node update processors that generate check node to variable node messages and provide minimum path updating circuitry. One variable node processor in each of the respective data decoder engines provides an output to a circuit that is common to all of the data decoder engines. The other variable node update processor in each of the respective data decoder engines receives an input from the common circuit. The common circuit performs syndrome calculation, shifting and convergence checking for the information received from the data decoder engines, and provides both a decoder output in the case of convergence, and feedback to the data decoder engines where convergence fails. By distributing the architecture into multiple decoder engines that operate substantially autonomous from other decoder engines, routing and layout becomes more practical. Read channel circuit 110 may be implemented to include a data processing circuit similar to that set forth below in relation to FIGS. 3 and 4 a-4 b below.

It should be noted that storage system 100 may be integrated into a larger storage system such as, for example, a RAID (redundant array of inexpensive disks or redundant array of independent disks) based storage system. Such a RAID storage system increases stability and reliability through redundancy, combining multiple disks as a logical unit. Data may be spread across a number of disks included in the RAID storage system according to a variety of algorithms and accessed by an operating system as if it were a single disk. For example, data may be mirrored to multiple disks in the RAID storage system, or may be sliced and distributed across multiple disks in a number of techniques. If a small number of disks in the RAID storage system fail or become unavailable, error correction techniques may be used to recreate the missing data based on the remaining portions of the data from the other disks in the RAID storage system. The disks in the RAID storage system may be, but are not limited to, individual storage systems such as storage system 100, and may be located in close proximity to each other or distributed more widely for increased security. In a write operation, write data is provided to a controller, which stores the write data across the disks, for example by mirroring or by striping the write data. In a read operation, the controller retrieves the data from the disks. The controller then yields the resulting read data as if the RAID storage system were a single disk.

A data decoder circuit used in relation to read channel circuit 110 may be, but is not limited to, a low density parity check (LDPC) decoder circuit as are known in the art. Such low density parity check technology is applicable to transmission of information over virtually any channel or storage of information on virtually any media. Transmission applications include, but are not limited to, optical fiber, radio frequency channels, wired or wireless local area networks, digital subscriber line technologies, wireless cellular, Ethernet over any medium such as copper or optical fiber, cable channels such as cable television, and Earth-satellite communications. Storage applications include, but are not limited to, hard disk drives, compact disks, digital video disks, magnetic tapes and memory devices such as DRAM, NAND flash, NOR flash, other non-volatile memories and solid state drives.

In addition, it should be noted that storage system 100 may be modified to include solid state memory that is used to store data in addition to the storage offered by disk platter 178. This solid state memory may be used in parallel to disk platter 178 to provide additional storage. In such a case, the solid state memory receives and provides information directly to read channel circuit 110. Alternatively, the solid state memory may be used as a cache where it offers faster access time than that offered by disk platter 178. In such a case, the solid state memory may be disposed between interface controller 120 and read channel circuit 110 where it operates as a pass through to disk platter 178 when requested data is not available in the solid state memory or when the solid state memory does not have sufficient storage to hold a newly written data set. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize a variety of storage systems including both disk platter 178 and a solid state memory.

Turning to FIG. 2, a data transmission system 291 including a receiver 295 having a distributed architecture low density parity check decoder circuitry in accordance with various embodiments of the present invention. Data transmission system 291 includes a transmitter 293 that is operable to transmit encoded information via a transfer medium 297 as is known in the art. The encoded data is received from transfer medium 297 by a receiver 295. Receiver 295 processes the received input to yield the originally transmitted data.

As part of processing the received information, receiver 295 utilizes a distributed architecture low density parity check decoder circuit. In operation, a codeword is divided into multiple parallel portions that are fed to respective data decoder engines. The data decoder engines each include variable node update processors that generate check node to variable node messages and provide minimum path updating circuitry. One variable node processor in each of the respective data decoder engines provides an output to a circuit that is common to all of the data decoder engines. The other variable node update processor in each of the respective data decoder engines receives an input from the common circuit. The common circuit performs syndrome calculation, shifting and convergence checking for the information received from the data decoder engines, and provides both a decoder output in the case of convergence, and feedback to the data decoder engines where convergence fails. By distributing the architecture into multiple decoder engines that operate substantially autonomous from other decoder engines, routing and layout becomes more practical. Receiver 295 may be implemented to include a data processing circuit similar to that set forth below in relation to FIGS. 3 and 4 a-4 b below.

Turning to FIG. 3, a data processing circuit 300 including a distributed architecture low density parity check decoder circuit 370 in accordance with some embodiments of the present invention. Data processing circuit 300 includes an analog front end circuit 310 that receives an analog signal 305. Analog front end circuit 310 processes analog signal 305 and provides a processed analog signal 312 to an analog to digital converter circuit 314. Analog front end circuit 310 may include, but is not limited to, an analog filter and an amplifier circuit as are known in the art. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize a variety of circuitry that may be included as part of analog front end circuit 310. In some cases, analog signal 305 is derived from a read/write head assembly (not shown) that is disposed in relation to a storage medium (not shown). In other cases, analog signal 305 is derived from a receiver circuit (not shown) that is operable to receive a signal from a transmission medium (not shown). The transmission medium may be wired or wireless. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize a variety of source from which analog input 305 may be derived.

Analog to digital converter circuit 314 converts processed analog signal 312 into a corresponding series of digital samples 316. Analog to digital converter circuit 314 may be any circuit known in the art that is capable of producing digital samples corresponding to an analog input signal. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize a variety of analog to digital converter circuits that may be used in relation to different embodiments of the present invention. Digital samples 316 are provided to an equalizer circuit 320. Equalizer circuit 320 applies an equalization algorithm to digital samples 316 to yield an equalized output 325. In some embodiments of the present invention, equalizer circuit 320 is a digital finite impulse response filter circuit as are known in the art. It may be possible that equalized output 325 may be received directly from a storage device in, for example, a solid state storage system. In such cases, analog front end circuit 310, analog to digital converter circuit 314 and equalizer circuit 320 may be eliminated where the data is received as a digital data input. Equalized output 325 is stored to an input buffer 353 that includes sufficient memory to maintain a number of codewords until processing of that codeword is completed through a data detector circuit 330 and a layered decoder and non-layered reuse circuitry 370including, where warranted, multiple global iterations (passes through both data detector circuit 330 and a distributed architecture low density parity check decoder circuit 370) and/or local iterations (passes through a distributed architecture low density parity check decoder circuit 370 during a given global iteration). An output 357 is provided to data detector circuit 330.

Data detector circuit 330 may be a single data detector circuit or may be two or more data detector circuits operating in parallel on different codewords. Whether it is a single data detector circuit or a number of data detector circuits operating in parallel, data detector circuit 330 is operable to apply a data detection algorithm to a received codeword or data set. In some embodiments of the present invention, data detector circuit 330 is a Viterbi algorithm data detector circuit as are known in the art. In other embodiments of the present invention, data detector circuit 330 is a maximum a posteriori data detector circuit as are known in the art. Of note, the general phrases “Viterbi data detection algorithm” or “Viterbi algorithm data detector circuit” are used in their broadest sense to mean any Viterbi detection algorithm or Viterbi algorithm detector circuit or variations thereof including, but not limited to, bi-direction Viterbi detection algorithm or bi-direction Viterbi algorithm detector circuit. Also, the general phrases “maximum a posteriori data detection algorithm” or “maximum a posteriori data detector circuit” are used in their broadest sense to mean any maximum a posteriori detection algorithm or detector circuit or variations thereof including, but not limited to, simplified maximum a posteriori data detection algorithm and a max-log maximum a posteriori data detection algorithm, or corresponding detector circuits. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize a variety of data detector circuits that may be used in relation to different embodiments of the present invention. In some cases, one data detector circuit included in data detector circuit 330 is used to apply the data detection algorithm to the received codeword for a first global iteration applied to the received codeword, and another data detector circuit included in data detector circuit 330 is operable apply the data detection algorithm to the received codeword guided by a decoded output accessed from a central memory circuit 350 on subsequent global iterations. As described below in relation to FIGS. 4 a-4 b, central memory circuit 350 divides the data sets into a defined number of portions that are processed by respective decoder engines implemented as part of distributed architecture low density parity check decoder circuit 370.

Upon completion of application of the data detection algorithm to the received codeword on the first global iteration, data detector circuit 330 provides a detector output 333. Detector output 333 includes soft data. As used herein, the phrase “soft data” is used in its broadest sense to mean reliability data with each instance of the reliability data indicating a likelihood that a corresponding bit position or group of bit positions has been correctly detected. In some embodiments of the present invention, the soft data or reliability data is log likelihood ratio data as is known in the art. Detector output 333 is provided to a local interleaver circuit 342. Local interleaver circuit 342 is operable to shuffle sub-portions (i.e., local chunks) of the data set included as detected output and provides an interleaved codeword 346 that is stored to central memory circuit 350. Interleaver circuit 342 may be any circuit known in the art that is capable of shuffling data sets to yield a re-arranged data set. Interleaved codeword 346 is stored to central memory circuit 350.

Once distributed architecture low density parity check decoder circuit 370 is available, portions of a previously stored interleaved codeword 346 are accessed from central memory circuit 350 as a stored codeword 386 and globally interleaved by a global interleaver/de-interleaver circuit 384. Global interleaver/de-interleaver circuit 384 may be any circuit known in the art that is capable of globally rearranging codewords. Global interleaver/De-interleaver circuit 384 provides a decoder input 352 as respective portions to respective data decoder engines implemented as part of distributed architecture low density parity check decoder circuit 370. One implementation of distributed architecture low density parity check decoder circuit 370 is set forth below in relation to FIGS. 4 a-4 b. Distributed architecture low density parity check decoder circuit 370 applies a low density parity check decode algorithm to the received data to yield a decoded output 371. As more fully discussed below, application of the low density parity check algorithm includes calculating check node to variable node messages for portions of the data set accessed from central memory 350 in respective data decoder engines that in turn produce an output. The produced output from each of the data decoder engines are provided to a circuit common to all of the data decoder engines. The common circuit performs syndrome calculation, shifting and convergence checking for the information received from the data decoder engines, and provides both a decoder output in the case of convergence, and feedback to the data decoder engines where convergence fails. By distributing the architecture into multiple data decoder engines that operate substantially autonomous from other data decoder engines, routing and layout of data decoder circuit 370 becomes more practical.

In cases where another local iteration (i.e., another pass through data decoder circuit 370) is desired, distributed architecture low density parity check decoder circuit 370 re-applies the data decode algorithm to decoder input 352 guided by decoded output 371. This continues until either a maximum number of local iterations is exceeded or decoded output 371 converges (i.e., completion of standard processing).

Where decoded output 371 fails to converge (i.e., fails to yield the originally written data set) and a number of local iterations through layered distributed architecture low density parity check decoder circuit 370 exceeds a threshold, the resulting decoded output is provided as a decoded output 354 back to central memory circuit 350 where it is stored awaiting another global iteration through a data detector circuit included in data detector circuit 330. Prior to storage of decoded output 354 to central memory circuit 350, decoded output 354 is globally de-interleaved to yield a globally de-interleaved output 388 that is stored to central memory circuit 350. The global de-interleaving reverses the global interleaving earlier applied to stored codeword 386 to yield decoder input 352. When a data detector circuit included in data detector circuit 330 becomes available, a previously stored de-interleaved output 388 is accessed from central memory circuit 350 and locally de-interleaved by a de-interleaver circuit 344. De-interleaver circuit 344 re-arranges decoder output 348 to reverse the shuffling originally performed by interleaver circuit 342. A resulting de-interleaved output 397 is provided to data detector circuit 330 where it is used to guide subsequent detection of a corresponding data set previously received as equalized output 325.

Alternatively, where the decoded output converges (i.e., yields the originally written data set), the resulting decoded output is provided as an output codeword 372 to a de-interleaver circuit 380 that rearranges the data to reverse both the global and local interleaving applied to the data to yield a de-interleaved output 382. De-interleaved output 382 is provided to a hard decision buffer circuit 390 that arranges the received codeword along with other previously received codewords in an order expected by a requesting host processor. The resulting output is provided as a hard decision output 392.

Turning to FIG. 4 a, one implementation of a distributed architecture low density parity check decoder circuit 400 is shown in accordance with one or more embodiments of the present invention. Distributed architecture low density parity check decoder circuit 400 may be used in place of distributed architecture low density parity check decoder circuit 370 of FIG. 3. As shown, distributed architecture low density parity check decoder circuit 400 includes eight data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 and a common circuit 490. Of note, while distributed architecture low density parity check decoder circuit 400 is shown with eight data decoder engines, other embodiments of the present invention may be implemented by more or fewer data decoder engines.

Common circuit 490 includes a syndrome calculation circuit 440, a shifter circuit 450, a convergence check circuit 460, and an instruction read only memory (ROM) 432. Syndrome calculation circuit 440 is operable to calculate a syndrome of a processing codeword using sub-messages 411, 413, 415, 417, 421, 423, 425, 427 passed from each of the respective data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426. Syndrome calculation circuit 440 may be implemented similar to any syndrome calculation circuit implemented as part of a low density parity check decoder circuit known in the art. Syndrome calculation circuit 440 operates based upon instructions 434 received from instruction ROM 430. A syndrome calculated by syndrome calculation circuit 440 is provided as a syndrome output 442 to convergence check circuit 460. In addition, syndrome calculation circuit 440 provides a aggregated message 441 (corresponding to decoded output 371 of FIG. 3) which is formed by aggregating the sub-messages 411, 413, 415, 417, 421, 423, 425, 427 from the respective data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 to a shifter circuit 450.

Convergence check circuit 460 uses syndrome output 442 to determine whether application of the data decode algorithm has resulted in the original data set. In one particular embodiment of the present invention, convergence check circuit 460 compares syndrome output 442 with an expected syndrome value. In particular embodiments of the present invention, convergence check circuit 460 operates based upon instructions 432 received from instruction ROM 430. Where syndrome output 442 matches the expected syndrome value, convergence check circuit 460 indicates a converged output 462 to shifter circuit 450, and provides the current state of distributed architecture low density parity check decoder circuit 400 as a decoder output 472 (corresponding to output codeword 372 of FIG. 3). Convergence check circuit 460 may be implemented similar to any convergence check circuit implemented as part of a low density parity check decoder circuit known in the art.

Shifter circuit 450 shifts aggregated message 441 to yield a shifted message set. The shifted message set is then disaggregated into sub-messages 481, 482, 483, 484, 485, 486, 487, 488. Sub-messages 481, 482, 483, 484, 485, 486, 487, 488 are provided back to the respective data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 where converged output 462 does not indicate a convergence, and thus another iteration is needed. In turn, the respective data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 process sub-messages 481, 482, 483, 484, 485, 486, 487, 488.

Turning to FIG. 4 b, one implementation of a data decoder engine 401 is shown in accordance with one or more embodiments of the present invention. Data decoder engine 401 may be used in place of each of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 of FIG. 4 a. As shown, data decoder engine 401 includes a variable node update processor 403, a variable node update processor 407, a c2v new message circuit 404, a minimum path update circuit 405, and a c2v old message circuit 406. In addition, a central memory portion 402 is shown representing a portion of central memory 350 of FIG. 3 that is assigned to the particular data decoder engine.

In one particular embodiment of the present invention, the central memory portion associated with data decoder engine 410 maintains the first twelve (12) symbols (Symbols 0-11) of log likelihood ratio information; the central memory portion associated with data decoder engine 412 maintains the second twelve (12) symbols (Symbols 12-23) of log likelihood ratio information; the central memory portion associated with data decoder engine 414 maintains the third twelve (12) symbols (Symbols 24-35) of log likelihood ratio information; the central memory portion associated with data decoder engine 416 maintains the fourth twelve (12) symbols (Symbols 36-47) of log likelihood ratio information; the central memory portion associated with data decoder engine 420 maintains the fifth twelve (12) symbols (Symbols 48-59) of log likelihood ratio information; the central memory portion associated with data decoder engine 422 maintains the sixth twelve (12) symbols (Symbols 60-71) of log likelihood ratio information; the central memory portion associated with data decoder engine 424 maintains the seventh twelve (12) symbols (Symbols 72-83) of log likelihood ratio information; and the central memory portion associated with data decoder engine 426 maintains the eighth twelve (12) symbols (Symbols 84-95) of log likelihood ratio information.

Variable node update processor 403 is operable to add a portion 471 of a codeword from central memory portion 402 to a new check node to variable node message 473 to yield an interim output, and performs a delta rearrange of the interim output to yield a sub-message 478. Sub-message 478 is provided as a respective one of sub-messages 411, 413, 415, 417, 421, 423, 425, 427 described above in relation to FIG. 4 a. The delta rearrange process may be any delta rearrange process implemented as part of a low density parity check encoder circuit.

Variable node update processor 407 subtracts a sub-message 479 from an old check node to variable node message 475 to yield an interim output, and normalizes the interim output to an expected range to yield a portion 472 of a codeword stored back to central memory portion 402. Sub-message 479 is one of the respective sub-messages 481, 482, 483, 484, 485, 486, 487, 488 described above in relation to FIG. 4 b.

In addition, variable node update processor 407 provides portion 472 as a minimum path update output 474 to minimum path update circuit 405 that operates to determine a first minimum (referred to as mini representing the smallest distance between nodes on a trellis diagram), a second minimum (referred to as mini representing the next smallest distance between nodes on a trellis diagram), and an index. Minimum path update circuit 405 may be implemented similar to any minimum path update circuit implemented as part of a low density parity check decoder circuit known in the art. Minimum path update circuit 405 provides a determined minimum path output 476 (i.e., Min1/Min2/idx) to c2v new message circuit 404 and to c2v old message circuit 406.

Each of c2v new message circuit 404 and c2v old message circuit 406 update new check node to variable node message 473 and old check node to variable node message 475, respectively. c2v new message circuit 404 may be any circuit known in the art for generating a new check node to variable node message in a low density parity check decoder circuit. Similarly, c2v old message circuit 404 may be any circuit known in the art for generated an old check node to variable node message in a low density parity check decoder circuit.

Returning to FIG. 4 a, the processing of check nodes in each of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 is independent of processing of the check nodes in all of the other data decoder engines. As such, the need for electrical connection between respective ones of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 is reduced which in turn reduces the routing congestion in a semiconductor device in which a data decoder circuit is implemented. In the case of data decoder engine 401, each of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 processes one eighth (⅛th) of the check nodes for the overall codeword, and one eighth (⅛th) of the variable nodes of the overall codeword. In some embodiments of the present invention, the number of electrical connections from any of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 to another one of the data decoder engines is less than twenty signal wires or routes. As used herein, the phrases “signal wire” or “signal route” are used interchangeably in their broadest sense to mean a connection between electrical components able to carry a signal between the electrical components. In some cases, the number of electrical connections from any of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 to another one of the data decoder engines is less than ten signal wires or routes. In various cases, the number of electrical connections from any of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 to another one of the data decoder engines is less than five signal wires or routes. In one particular case, there are not any physical routes or wires from any of data decoder engines 410, 412, 414, 416, 418, 420, 422, 424, 426 to another one of the data decoder engines.

Turning to FIG. 5 an example physical layout 500 facilitated by a distributed architecture low density parity check decoder circuit is shown in accordance with some embodiments of the present invention. As shown, the distributed architecture allows for physical distribution of data decoder engines 510, 512, 514, 516, 518, 520, 522, 524, 526 around the periphery of a common circuit 590. As physical connectivity between common circuit 590 and any given data decoder engine 510, 512, 514, 516, 518, 520, 522, 524, 526 is not extensive, layout and routing involved in implementing a data decoder circuit similar to that discussed above in relation to FIGS. 4 a-4 b is reduced when compared with prior art solutions using a non-distributed architecture. Of note, while FIG. 5 shows data decoder engines 510, 512, 514, 516, 518, 520, 522, 524, 526 to allow for routing connectivity with upstream processing circuitry in a data processing circuit (e.g., an upstream data detector circuit), other layouts are possible in accordance with other embodiments of the present invention. For example, data decoder engines 510, 512, 514, 516, 518, 520, 522, 524, 526 may be equally spaced around the periphery of a common circuit 590. Based upon the disclosure provided herein, one of ordinary skill in the art will recognize other physical layouts that are possible in accordance with different embodiments of the present invention.

In particular embodiments of the present invention, the memory portions included as part of each data decoder engine 510, 512, 514, 516, 518, 520, 522, 524, 526 are placed at the boundary of the overall data decoder circuit to reduce routing complexity caused by electrical connections coming from a data detector circuit side of a data processor.

It should be noted that the various blocks discussed in the above application may be implemented in integrated circuits along with other functionality. Such integrated circuits may include all of the functions of a given block, system or circuit, or a subset of the block, system or circuit. Further, elements of the blocks, systems or circuits may be implemented across multiple integrated circuits. Such integrated circuits may be any type of integrated circuit known in the art including, but are not limited to, a monolithic integrated circuit, a flip chip integrated circuit, a multichip module integrated circuit, and/or a mixed signal integrated circuit. It should also be noted that various functions of the blocks, systems or circuits discussed herein may be implemented in either software or firmware. In some such cases, the entire system, block or circuit may be implemented using its software or firmware equivalent. In other cases, the one part of a given system, block or circuit may be implemented in software or firmware, while other parts are implemented in hardware.

In conclusion, the invention provides novel systems, devices, methods and arrangements for data processing. While detailed descriptions of one or more embodiments of the invention have been given above, various alternatives, modifications, and equivalents will be apparent to those skilled in the art without varying from the spirit of the invention. Therefore, the above description should not be taken as limiting the scope of the invention, which is defined by the appended claims. 

What is claimed is:
 1. A data processing system, the data processing system comprising: a low density parity check data decoder circuit including at least a first data decoder engine and a second data decoder engine each electrically coupled to a common circuit, wherein the first data decoder engine is operable to perform variable node updating for a first portion of a codeword and to generate a first sub-message corresponding to the first portion, and the second data decoder engine is operable to perform variable node updating for a second portion of the codeword and to generate a second sub-message corresponding to the second portion, and wherein the common circuit is operable to: shift a combination of both the first sub-message and the second sub-message to yield a shifted output, and disaggregate the shifted output to yield a third sub-message to the first data decoder engine and a fourth sub-message to the second decoder engine.
 2. The data processing system of claim 1, wherein the first data decoder engine is not electrically connected to the second data decoder engine.
 3. The data processing system of claim 1, wherein the number of signal routes directly between the first data decoder engine and the second data decoder engine is less than twenty.
 4. The data processing system of claim 1, wherein the number of signal routes directly between the first data decoder engine and the second data decoder engine is less than ten.
 5. The data processing system of claim 1, wherein the common circuit is further operable to perform a syndrome calculation on a combination of at least the first portion and the second portion of the codeword.
 6. The data processing system of claim 5, wherein: the first data decoder engine is operable to: generate a first new check node to variable node message; add a first portion of a codeword to the first new check node to variable node message to yield a first interim output; rearrange the first interim output to yield the first sub-message; and provide the first sub-message to the common circuit; the second data decoder engine is operable to: generate a second new check node to variable node message; add a second portion of the codeword to the second new check node to variable node message to yield a second interim output; rearrange the second interim output to yield the second sub-message; and provide the second sub-message to the common circuit.
 7. The data processing system of claim 6, wherein: the first data decoder engine includes: a first minimum path calculation circuit operable to calculate a first minimum path between a present node and a next node; and a first new message update circuit operable to generate the first new check node to variable node message based at least in part on the first minimum path; and the second data decoder engine includes: a second minimum path calculation circuit operable to calculate a second minimum path between a present node and a next node; and a second new message update circuit operable to generate the second new check node to variable node message based at least in part on the second minimum path.
 8. The data processing system of claim 6, wherein the common circuit is operable to: perform the syndrome calculation based at least in part on both the first sub-message and the second sub-message to yield a syndrome value.
 9. The data processing system of claim 8, wherein the common circuit is further operable to determine whether processing of a data set represented at least in part by the first sub-message and the second sub-message converged.
 10. The data processing system of claim 9, wherein processing of the data set is determined to converge when the syndrome value equals a defined value.
 11. The data processing system of claim 10, wherein the defined value is zero.
 12. The data processing system of claim 8, wherein; the first data decoder engine is further operable to: generate a first old check node to variable node message; subtract the third sub-message from the first old check node to variable node message to yield a third interim output; and normalize the third interim output to an expected range; and the second data decoder engine is further operable to: generate a second old check node to variable node message; subtract the fourth sub-message from the second old check node to variable node message to yield a fourth interim output; and normalize the fourth interim output to the expected range.
 13. The data processing system of claim 1, wherein the first data decoder engine and the second data decoder engine are physically layed out in relation to the common circuit such that at least one side of the common circuit is physically accessible by routes from another circuit.
 14. The data processing system of claim 1, wherein the data processing system is implemented as an integrated circuit.
 15. The data processing system of claim 1, wherein the data processing system is incorporated in a device selected from a group consisting of: a storage device, and a data communication device.
 16. The data processing system of claim 1, wherein the data processing system is implemented as part of a data storage device, and wherein the data storage device comprises: a storage medium operable to store information corresponding to a codeword; a head assembly disposed in relation to the storage medium and operable to provide a sensed signal corresponding to the information; a read channel circuit including: an analog front end circuit operable to provide an analog signal corresponding to the sensed signal; an analog to digital converter circuit operable to sample the analog signal to yield a series of digital samples; and an equalizer circuit operable to equalize the digital samples corresponding to the data set to yield the codeword, wherein the codeword includes the first portion of the codeword and the second portion of the codeword.
 17. A method for data processing, the method comprising: providing a distributed architecture data decoder circuit, the distributed architecture data decoder circuit comprising: a low density parity check data decoder circuit including at least a first data decoder engine and a second data decoder engine each electrically coupled to a common circuit, wherein the first data decoder engine is operable to perform variable node updating for a first portion of a codeword and the second data decoder engine is operable to perform variable node updating for a second portion of the same codeword, and wherein the common circuit is operable to perform a syndrome calculation on a combination of at least the first portion and the second portion of the codeword; generating a first new check node to variable node message in the first data decoder engine; adding a first portion of a codeword to the first new check node to variable node message to yield a first interim output in the first data decoder engine; rearranging the first interim output to yield a first sub-message in the first data decoder engine; generating a second new check node to variable node message in the second data decoder engine; adding a second portion of the codeword to the second new check node to variable node message to yield a second interim output in the second data decoder engine; rearranging the second interim output to yield a second sub-message in the second data decoder engine; and providing the second sub-message to the common circuit; shifting a combination of both the first sub-message and the second sub-message to yield a shifted output in the common circuit; and disaggregating the shifted output to yield a third sub-message to the first data decoder engine and a fourth sub-message to the second decoder engine in the common circuit.
 18. The method of claim 17, wherein the number of signal routes directly between the first data decoder engine and the second data decoder engine is less than twenty.
 19. The method of claim 17, wherein the common circuit is operable to: perform the syndrome calculation based at least in part on both the first sub-message and the second sub-message to yield a syndrome value.
 20. A data processing system, the data processing system comprising: a low density parity check data decoder circuit including at least a first data decoder circuit and a second data decoder circuit each electrically coupled to a common circuit, wherein the first data decoder circuit is operable to perform variable node updating for a first portion of a codeword, and the second data decoder circuit is operable to perform variable node updating for a second portion of the codeword, wherein the common circuit is operable to perform an error determination on a combination of at least the first portion and the second portion of the codeword, and wherein the first data decoder circuit and the second data decoder circuit are physically layed out in relation to the common circuit such that at least one side of the common circuit is physically accessible by routes from another circuit.
 21. The data processing system of claim 20, wherein the data processing system is implemented as part of a data storage device, and wherein the data storage device comprises: a storage medium operable to store information corresponding to a codeword; a head assembly disposed in relation to the storage medium and operable to provide a sensed signal corresponding to the information; a read channel circuit including: an analog front end circuit operable to provide an analog signal corresponding to the sensed signal; an analog to digital converter circuit operable to sample the analog signal to yield a series of digital samples; and an equalizer circuit operable to equalize the digital samples corresponding to the data set to yield the codeword, wherein the codeword includes the first portion of the codeword and the second portion of the codeword.
 22. A data processing system, the data processing system comprising: a data decoder circuit including at least: a first means for performing variable node updating for a first portion of a codeword, and generating a first sub-message corresponding to the first portion; a second means for performing variable node updating for a second portion of a codeword, and generating a second sub-message corresponding to the second portion; and a common means for shifting a combination of both the first sub-message and the second sub-message to yield a shifted output, and disaggregating the shifted output to yield a third sub-message to the first data decoder engine and a fourth sub-message to the second decoder engine. 